# Low VRAM requirement

Qwen3 8B FP8 Dynamic
Apache-2.0
Qwen3-8B-FP8-dynamic is an optimized version of the Qwen3-8B model through FP8 quantization, significantly reducing GPU memory requirements and disk space usage while maintaining the original model's performance.
Large Language Model Transformers
Q
RedHatAI
81
1
Hidream I1 Fast Nf4
MIT
HiDream-I1 is an open-source image generation foundation model with 17 billion parameters. The 4-bit quantized version can run on 16GB VRAM, enabling fast and high-quality image generation.
Image Generation
H
azaneko
19.22k
7
Wan2.1 I2V 14B 720P Diffusers
Apache-2.0
Wan2.1 is a comprehensive open-source video foundation model with top-tier performance, supporting consumer-grade GPUs, multi-task capabilities, visual text generation, and efficient video VAE.
Video Processing Supports Multiple Languages
W
grnr9730
96
0
Wan2.1 T2V 14B
Apache-2.0
Wan2.1 is an open and advanced large-scale video generation model that supports various tasks including text-to-video and image-to-video, compatible with consumer-grade GPUs.
Text-to-Video Supports Multiple Languages
W
Isi99999
6,470
0
Mistral Small 24B Instruct 2501 GPTQ G128 W4A16 MSE
Apache-2.0
This is the 4-bit quantized version of the mistralai/Mistral-Small-24B-Instruct-2501 model, quantized by ConfidentialMind.com, achieving a smaller and faster model with minimal performance loss.
Large Language Model English
M
ConfidentialMind
93
1
Svdq Int4 Flux.1 Schnell
Apache-2.0
INT4 quantized version of FLUX.1-schnell, enabling efficient text-to-image generation with SVDQuant technology
Text-to-Image English
S
mit-han-lab
20.14k
9
Llama 3.2 1B Instruct FP8
FP8 quantized version of Llama-3.2-1B-Instruct, suitable for multilingual business and research applications, with performance close to the original model.
Large Language Model Safetensors Supports Multiple Languages
L
RedHatAI
1,718
3
Meta Llama 3.1 405B Instruct FP8 Dynamic
FP8 quantized version of Meta-Llama-3.1-405B-Instruct, suitable for multilingual commercial and research purposes, specially optimized for assistant robot scenarios.
Large Language Model Transformers Supports Multiple Languages
M
RedHatAI
97
15
Meta Llama 3.1 8B Instruct FP8
FP8 quantized version of Meta-Llama-3.1-8B-Instruct, suitable for multilingual business and research applications, specially optimized for assistant-like chat scenarios.
Large Language Model Transformers Supports Multiple Languages
M
RedHatAI
361.53k
42
Dreamshaper Xl Lightning
An efficient text-to-image generation model fine-tuned based on Stable Diffusion XL, supporting rapid generation of artistic images
Image Generation Supports Multiple Languages
D
Lykon
10.57k
59
Sotemixv2
Openrail
SoteMix V2.1 is a high-resolution text-to-image model based on Stable Diffusion, specializing in artistic and anime-style image generation.
Image Generation Supports Multiple Languages
S
Disty0
25
3
Lcm Lora Ssd 1b
MIT
A text-to-image generation model fine-tuned from SSD-1B using LCM-LoRA technology, supporting rapid generation of high-quality images
Text-to-Image
L
openskyml
73
1
Tiny Sd
Openrail
A lightweight text-to-image generation model optimized through distillation based on Realistic_Vision_V4.0, achieving 80% faster speed than base SD1.5
Image Generation
T
segmind
23.05k
63
Llava 13b V0 4bit 128g
LLaVA is a multimodal model combining vision and language, based on the LLaMA architecture, supporting image understanding and dialogue generation.
Text-to-Image Transformers
L
wojtab
167
79
Gpt J 6B 8bit
Apache-2.0
This is the 8-bit quantized version of EleutherAI's GPT-J 6B parameter model, optimized for running and fine-tuning on limited GPU resources (e.g., Colab or 1080Ti).
Large Language Model Transformers English
G
hivemind
176
131
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase